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1  Introduction 


A  fundamental  issue  of  theoretical  computer  science  is  the  degree  to  which  randomness  helps  in 
computation.  In  many  cases,  the  most  natural  algorithm  to  solve  a  problem  involves  randomness. 
Often,  however,  it  is  possible  to  convert  a  randomized  algorithm  into  a  deterministic  one. 

For  many  applications  [K\V,L1,ABI],  the  problem  of  removing  randomness  from  an  algorithm 

can  be  solved  by  finding  an  A'  =<  A'i, . Y„  >  such  that  F( X)  >  £[F(A')],  for  some  function 

F  and  some  sample  space  A'  over  which  the  expectation  is  to  be  computed.  The  problem  is  then 
how  best  to  find  a  good  sample  point  (e.g.,  an  A'  such  that  F(X)  >  E[F(X)})  in  S.  If  the  space 
of  sample  points  is  small  (e.g..  polynomial),  then  this  can  be  accomplished  by  brute  force;  namely, 
we  could  try  all  points  until  we  get  a  good  one.  for  one  must  exist.  However,  typical  sample  spaces 
are  larger  than  polynomial,  and  brute  force  is  too  expensive.  In  such  situations,  the  only  general 
method  available  is  due  to  Raghavan  and  Spencer  [Rag.S].  Their  method  works  by  setting  the  A',’s 
one  by  one  in  such  a  way  as  to  not  decrease  the  conditional  expectation  (e.g.,  setting  Ar,,+i  so 

that  F’[F(A")  I  A'i . A',+i]  >  E[F(X)  |  A’i . A',]).  The  main  difficulty  with  this  approach  is 

computing  the  condi* ional  expectations  —  the  ability  to  do  so  determines  when  the  method  can 
and  cannot  be  used. 

Unfortunately,  the  Spencer/Raghavan  method  is  inherently  sequential.  Moreover,  the  running 
time  is  at  least  n .  Since  the  best  time  one  could  hope  for  is  logarithmic  in  the  size  of  the  sample 
space,  for  large  probability  spaces  this  is  probably  as  good  as  one  can  get;  yet,  for  smaller  spaces, 
n  is  far  from  optimal.  Tite  only  improvement  to  this  approach  was  by  Luby  [L2]  who  showed 
how  to  search  in  time  logarithmic  in  the  size  of  the  sample  space  for  the  special  case  of  pairwise 
independence,  thmeby  improving  the  processor  efficiency  of  several  NC  algorithms  (A  +  1  vertex 
coloring,  MIS,  and  maximal  matching)  from  n2(n  +  m)  to  n  +  m. 

In  this  paper,  we  show  how  to  search  in  time  logarithmic  in  the  size  of  the  sample  space  for  a  wide 
range  of  functions  F  and  arbitrarily  large  sample  spaces.  As  a  result,  we  can  prove  substantially 
stronger  results  than  is  possible  with  the  Lubv  method.  In  particular,  we  are  able  to  derive  NC 
algorithms  for  several  problems  that  were  not  previously  known  to  be  in  NC,  and  we  can  search 
(Iog':  n)-wise  independent  ?)log  "-size  sample  spaces  in  NC. 

In  Section  2,  we  demonstrate  how  our  techniques  apply  to  the  problem  of  set  discrepancy.  In 
this  problem,  we  are  given  a  set  of  n  points  and  n  subsets  of  at  most  A  of  these  points,  and  we  want 
to  color  the  points  0  and  1  so  that  the  discrepancy,  or  maximum  difference  between  the  number  of 
0’s  and  the  number  of  l’s  in  any  subset,  is  small.  The  best  known  randomized  (parallel)  algorithm 
achieves  a  discrepancy  of  >/A  log  n;  Raghavan  and  Spencer  applied  their  technique  to  this  algorithm 
to  obtain  a  deterministic  sequential  algorithm  with  the  same  bound.  We  show  that  the  randomized 
algorithm  requires  ^^■-independence  to  give  a  discrepancy  bound  of  A^2+f\/log  n  for  any  fixed 
f  >  0;  then  we  apply  our  techniques  to  convert  this  to  an  NC  algorithm  which  attains  the  same 
bound  (using  n],r  processors).  This  algorithm  has  many  applications.  As  an  example,  we  give 
a  deterministic  version  of  the  Karloff-Shmoys  parallel  edge  coloring  algorithm  [KS],  obtaining  the 
same  A  +  A’/2f'  bound  on  the  number  of  colors  used  ac  their  randomized  algorithm  and  beating 
the  known  deterministic  bound  of  2A  —  1  [1.2]. 

In  Section  3.  we  show  how  to  apply  our  techniques  to  a  large  class  of  problems  which  depend  on 
(logc  n )•  wise  independence.  In  particular,  we  describe  an  NC  algorithm  for  obtaining  the  expected 
value  of  any  function  which  i.->  the  sum  of  a  polynomial  number  of  terms,  each  depending  on  O(logn) 
binary  random  variables;  e.g,,  a  function  of  the  form 


A  )  —  'y  /,  (  A  ,,i , . . . ,  A  » )• 

i  =  i 
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Alternatively,  we  can  allow  the  terms  to  he  simple  functions  of  logc  n  random  variables;  e.g., 
characteristic  functions  of  affine  subspaces,  which  by  a  reduction  can  be  used  to  build  any  function 
which  is  non-zero  for  only  a  polynomial  number  of  points. 

Finally,  in  Section  4,  we  give  several  methods  for  extending  our  technique  to  multi-valued 
random  variables.  As  an  illustration,  we  consider  the  hypergraph  coloring  problem:  given  a  d- 
uniform  hypergraph  (V,£),  color  the  vertices  with  d  colors  so  that  at  least,  d\\£\/dd  edges  have  one 
vertex  of  each  color.  If  d  is  a  constant,  this  problem  can  be  solved  by  trying  all  sample  points  in  a 
c/-wise  independent  distribution  [ABI].  Using  our  techniques  for  handling  (logn)-wise  independent 
multi-valued  random  variables,  we  give  a  deterministic  NC  algorithm  which  solves  this  problem  tor 
all  d.  The  particular  technique  used  to  handle  the  multi-valued  random  variables  in  this  problem 
involves  generating  and  solving  a  series  of  problems  with  binary  random  variables,  highlighting  the 
importance  of  being  able  to  solve  a  large  class  of  these. 

Our  generalization  of  Luby's  work  to  remove  randomness  from  (logn)-wise  independent 
discrepancy-based  problems  such  as  edge  coloring  was  worked  out  in  November,  1988,  and  first 
appears  in  [B] .  Our  general  framework  for  removing  randomness  from  (logc  n)-wise  independent  NC 
calculations  was  worked  out  and  written  up  by  mid- December.  1988.  In  January,  1989.  Motwani, 
Naor.  and  Naor  [MNN]  obtained  our  results  for  discrepancy-based  problems  such  as  edge  coloring. 

2  An  Example  of  the  Method  —  Set  Discrepancy 

2.1  Definition  of  Problem 

Spencer  [S,  p.  30]  defines  the  sef  discrepancy  problem  as  follows.  Let  A  C  21  ,  \A\  —  | T]  =  n,  be  a 
family  of  finite  sets.  Let  \  :F— >  {  — 1,  +1}  be  a  2-coloring  of  the  underlying  points.  Define 

\(.1)  =  £\(0 

disci \)  =  max  |,\M)|. 

We  want  to  find  a  \  such  that  disr(\)  is  small. 

How  small  can  we  make  disc{\ )?  Spencer  [S,  p.  73-77]  shows  that  there  exists  a  \  with 
disc(x)  =  0(y/n).  He  also  shows  that  this  is  the  best  possible;  i.e.,  that  there  exists  an  A  such 
that  all  \  have  disc(x)  =  (>/»)• 

It  is  interesting  to  bound  discrepancy  in  terms  of  maximum  degree  A  =  max,4e_4  \A\.  Spencer’s 
lower  bound  can  be  easily  modified  to  give,  for  any  A,  an  A  with  cardinality  n  and  maximum 
degree  A  such  that  all  \  have  disc(\ )  =  Q(\/rA).  It  is  easily  shown  in  Section  2.2  that  if  we  pick 
X  at  random,  with  high  probability  disc{\)  will  be  at  most  2\/A  lg  n.  This  immediately  gives  an 
RNC  algorithm  achieving  that  bound.  Raghavan  and  Spencer  [Rag,S]  show  how  to  convert  this 
into  a  deterministic  poly-time  algorithm.  In  the  sections  to  follow,  we  develop  an  NC  algorithm 
which  outputs  a  \  with  disc(\)  <  A,/2+f \/lg n. 

An  interesting  special  case  of  the  set  discrepancy  problem  is  the  graph  discrepancy  problem. 
Given  a  graph  G  —  (l7,  FA.  we  want  to  find  a  2- coloring  of  the  vertices  \  ;  V  — ►  {  —  1 ,  -f- 1 }  such  that 
max„£i/  I  Eue.V(ir)  \(")l  *s  small.  We  can  reduce  this  problem  to  s--t  discrepancy  by  letting  F  =  V 
and  A  =  {Ar(f’)  j  v  6  U}.  Plugging  in  our  N<’  algorithm  for  di  crepa.icy,  we  will  gei  a  \  with 
maxvgv  I  Ene,v(i)  \(«)|  <  A’^+'y/Iogn,  where  A  is  the  maximum  degree  of  G. 
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2.2  An  RNC  Algorithm 

Consider  the  following  algorithm  for  set  discrepancy:  randomly  pick  x  until  disc(x)  <  2\/A  In  n. 
One  iteration  of  this  is  clearly  in  RNC.  We  will  show  that  the  expected  number  of  iterations  is  less 
than  two.  The  following  lemmas  will  prove  useful. 

Lemma  2.1  (S,  p.  29)  Let  X\, . . . ,  X&  be  independent  and  identically  distributed  with  Pr[Xt  = 
+  1]  =  Pr[Xi  =  -1]  =  1/2.  Let  S  =  £,  A,-.  Then  Pr[S  >  A]  <  e~x2/2A. 

Lemma  2.2  Pr  jdi,sc(vi  >  2\/  X  In  n|  <  2 jn. 

Proof  Straightforward  application  of  Lemma  2.1.  □ 

Thus,  the  expected  number  of  iterations  is  at  most  1/  ^1  —  <2.  So  the  above  is  clearly  an 

RNC  algorithm.  Also,  one  can  easily  show  using  Lemma  2.2  that  £[disc(x)]  =  2v/ A  In  n. 

2.3  The  Overall  Approach 

Lemma  2.2  shows  that  the  probability  of  disc(x)  being  larger  than  2\/A  In  n  is  small.  This  implies 
that  there  exists  a  x  with  disc(x)  <  2\/Alnn.  We  wish  to  find  such  a  x  deterministically.  Unfor¬ 
tunately,  the  random  construction  of  Section  2.2  assumed  a  fully  independent  distribution,  which 
must  have  2n  sample  points.  Clearly,  we  cannot  search  this  sample  space  exhaustively.  However, 
Raghavan  and  Spencer  [R,S]  developed  a  method  to  perform  a  binary  search  on  the  sample  space. 
While  they  achieve  a  polynomial  time  algorithm,  it  requires  n  decisions  to  be  made.  Since  each 
decision  depends  on  previous  ones,  it  seems  very  unlikely  that  these  decisions  could  be  made  in 
parallel.  To  get  an  efficient  parallel  algorithm,  we  must  work  with  a  smaller  distribution  space.  A 
natural  choice  is  a  k- wise  independent  distribution,  where  k  is  small. 

Definition  2.3  Xi,...,Xn  are  k-wise  independent  if  for  any  k-subset  of  Xj, ..  .,Xn  and  for  any 
ri,...,rk, 


pr[X ii  =  n  A  Xi2  =  r2  A  •  •  •  A  Xik  =  rk] 

=  Pr[Xh  =r1]Pr[Xi2  =  r2]  •  •  -Pr[Xik  =  r*]. 

Ideally  our  goal  is  to  find  a  x  with  small  discrepancy  by  finding  a  x  for  which  disc(x)  < 
E[disc(x)],  where  the  expectation  is  taken  over  a  fc-wise  independent  distribution  for  some  small 
k.  The  choice  of  k  is  influenced  by  two  factors: 

1.  if  k  is  too  small,  then  E[disc(x)]  might  be  too  large  and 

2.  if  k  is  too  large,  then  finding  a  x  which  achieves  the  expectation  takes  too  long. 

As  a  compromise,  we  will  eventually  choose  k  =  ,  e  >  0. 

There  are  other  problems  with  this  approach,  however.  Most  important,  to  find  a  good  Xi  we 
will  need  to  compute  expectations  of  disc(x)  conditioned  on  some  knowledge  of  the  distribution  in 
NC.  This  is  hopelessly  complicated  by  the  max  and  absolute  value  in  disc(x)-  To  get  around  these 
problems,  we  will  use  higher  moments,  and  use  |x(-4)|fc  as  an  upper  bound  on  disck(\).  For 

even  k,  this  gets  rid  of  both  the  max  and  the  absolute  value.  In  other  words,  we  will 

1.  show  that  E  |xM)l*]  is  small  for  suitable  k  where  the  expectation  is  taken  over  a 

&-wise  independent  distribution,  and  then 
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2.  find  a  \  such  tliat  HazA  l\(-4)|fc  <  E  p/ie.4  l\(^)lfc]  ■ 

As  a  consequence,  we  will  have  found  a  \  for  which  disck(x)  is  small,  and  thus  for  which  disc(x) 
is  small.  By  choosing  k  -  ,  we  will  produce  a  \  for  which  disc(x)  <  This  is 

not  quite  the  V'A  log 7;"  hound  we  got  with  the  RNC-  algorithm,  but  it  is  close. 


2.4  Bounding  the  Independence  Needed 

Our  first  task  is  to  bound  E  \  j2.ieA  Xfc(-4)]-  This  is  accomplished  in  the  following  lemmas. 

Lemma  2.4  Any  function  which  is  the  sum  of  functions  depending  on  at  most  k  random  variables 
each  has  the  same  expected  value  taken  over  any  distribution  with  k  or  greater  independence. 

Proof  Follows  directly  from  the  definition  of  k- wise  independence  and  linearity  of  expected  value. 

□ 


Lemma  2.5  For  any  even  k-ivise  independent  distribution  and  for  all  A  E  A, 

£[X*M)]  <  0((ki\/e)k'2). 

Proof  L'[\*(A)]  —  2  ikEr[\(A)  =  i].  Break  up  the  sum  into  blocks  of  y/K.  For  i  E 
1,...,v/a|,  ik  is  bounded  above  by  (\/a)  ;  for  i  E  [\/A  +  1,...,2\/Aj,  ik  is  bounded  above 


by  ^2\/Aj  ,  and  so  on.  7’hen. 

cc  oo 

=  i]  <  2  y^(rv//A)jcPr[(r  -  1)\/A  <  x(^)  <  ry/K] 

r=l 

OO 

<  2J2(rVK)ke~lr-^'2 


»=i 


r  =  l 

(where  the  last  inequality  is  from  Lemma  2.1).  The  terms  of  the  sum  geometrically  increase  to  a 
point,  and  then  at  some  large  r,  they  begin  geometrically  decreasing.  Use  the  ratio  test  to  get  a 
sense  of  this: 

(r  +  1  )st  term  _  l1  +  r)  _  jc/r-e(fc/r2)-r+t/2 
rth  term  e(2r-i)/2 

So  for  small  r  the  terms  are  growing  at  least  geometrically,  and  ior  large  r  they  decrease  at  least 
geometrically.  The  maximum  term  is  about  v.here  eklr~e^klT  ^ —  4-1  /2  1  =  e°,  which  occurs  at 

r  =  \fk  +  8{  1).  The  constant  will  go  away  in  the  O  notation,  so  we  will  ignore  it.  We  could  show 
that  this  ratio  will  be  bigger  than  e  or  less  than  3/e  more  than  one  or  two  terms  away  from  r  =  yfk. 
Hence, 

E[Xk(A) ]  =  0  ((y/ky/Zy  e^2)  =  0  • 

The  above  calculations  assume  a  fully  independent  (or  at  least  A-wise  independent)  distribution. 
However, 

xfc(d)  =  (52  *(*))*  =  52  53  ■■■  53  x(*i) •  - -x(**)- 

i€/l  iiGAiiSA  ikG/l 

So  Lemma  2.4  applies  to  show  that  any  fc-wise  independent  distribution  gives  the  same  expected 
value.  □ 
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Corollary  2.6  For  any  even  k-wise  independent  distribution, 

£[£  \fcM)]  <  nO((k&/e)k'2). 


A£A 


We  can  now  give  a  lower  bound  on  the  value  for  k.  We  warn  <  A1/,2+{\/]og  n; 

this  is  roughly  captured  bv  having  n^/k  <  A£.  This  implies  we  need  k  =  2  ^  ^  *s  l^us, 

and  we  are  able  to  fmd  a  \  such  that  YLa^a  \k(A)  is  at  most  its  expectation,  this  implies  that 


disck( X)  <  £  \k(A)  <  £[£  X*M)]  <  0(n(kA/e)k'2). 


.4e^t 


AeA 


So, 


< 

< 

< 


disc(x)  <  0(nllk^JkA/e) 

0(A£v/Iv/A) 

A1//2+£  s/logn. 

Remark  2.7  For  any  n  and  A,  we  can  construct  a  ( )  ~ u,ise  independent  distribution  and  a 
set  system  A  with  maximum  degree  A  such  that  E[disc(x)]  —  D(A).  Therefore,  to  get  disc(\ )  < 
A1/24-',  any  method  based  on  independence  alone  will  require  (  )  -wise  independence. 

The  next  three  sections  will  be  devoted  to  finding  a  \  suc^  that  12AeAXk(A)  is  at  most  its 
expectation.  If  we  let  \(i)  ~  (  — 1)*' ,  finding  a  x  is  identical  to  finding  an  Y  which  satisfies  this 
relation.  From  this  point  on,  we  will  let 

F(X)  =  -  £  \k{A)  =  -££•••£  (-1)X- +  •+Xl* . 

A£A  AeAiieA  ik£A 

So  henceforth  we  want  an  Y  such  that  F'(X)  >  E[F(Y)]. 


2.5  Generating  k-'Wise  Independent  Variables 

It  still  remains  to  demonstrate  a  k- wise  independent  distribution  on  which  we  can  perform  a  binary 
search  efficiently  in  parallel. 

Luby  [L2]  gave  the  following  such  probability  space  for  the  case  k  =  2.  Let  /  =  flog71]  +  1  an^ 
u  be  picked  uniformly  from  Z\.  For  each  point  i,  let  <*(,...,  i/_i>  be  the  binary  expansion  of  i. 
Define  random  variables  Yj, .  . . ,  Xn  so  that 

(!=l  ) 

A,  =  ^  +  u>/  mod  2. 

v=>  J 

Observe  that  Luby’s  distribution  is  not  4-wise  independent  since,  in  particular,  Y4,  A'5,  A'e,  and 
X7  are  dependent. 

We  extend  Luby’s  distribution  to  be  /c-wise  independent  for  all  k  as  follows.  We  will  assign 
a  label  a,  6  Z\  to  each  point  i,  where  /  is  bounded  by  some  polvlogarithmic  function  of  n.  Pick 
u>  €  /?2  uniformly  at  random,  and  let 

Y,  =  a,  •  u. 

Note  that  we  can  express  Luby’s  distribution  in  this  framework  by  leting  a,  =<  ij , . . . ,  1 ,  1  >. 

The  main  benefit  of  our  distribution  is  that  we  can  now  give  a  necessary  and  sufficient  condition 
for  the  Y,’s  to  be  independent  and  unbiased.  The  following  result  is  similar  to  others  in  the  literature 
[MS,ABf]  and  can  probably  be  considered  to  be  well-known. 
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Theorem  2.8  A', X,k  are  independent  and  unbiased  if  and  only  if  are  linearly 

indejyendent  as  vectors  over  Zi- 

Proof  Utilizes  elementary  linear  algebra.  □ 

To  get  A'j, . . . ,  A'„  which  are  (log  7i)-wise  independent,  we  need  a  set  of  labels  at,  ■  . . ,  an  such 
that  every  log  n  of  them  are  linearly  independent.  (By  Theorem  2.8,  this  gives  us  logn-wise 
independence  of  the  A\’s.)  In  fact,  it  suffices  to  get  an  n  x  r  matrix  over  GF(23)  with  the  property 
that  any  log  n  rows  are  linearly  independent.  Letting  a,  be  the  ith  row  with  each  element  Qo  + 
Qix  +  •  •  •  +  a,_] x3_1  €  GF( 23)  expanded  out  to  <  o0,  •  •  .,0,-1  >  gives  length  l  =  rs  labels  such 
that  any  logn  are  linearly  independent  over  Z2.  Several  well-known  ways  of  getting  such  matrices, 
for  /  =  0(log2n),  are  described  in  [ABI.  Rah].  Even  randomly  chosen  labels  of  this  length  will 
work. 

Theorem  2.9  For  a  random  set  of  labels  {<Zi , . . . ,  an}  Q  Z^logi  n ,  Ai , . . . ,  Xn  are  (logn)-wise 
independent  with  high  probability. 

Proof  Omitted.  □ 

Since  any  probability  space  with  n  k-  wise  independent  random  variables  must  contain  Q((n/ k)\-kl2i) 
sample  points  [ABI,CGHFRS],  the  labels  at, . .  ,,an  must  be  fl(log2  n)  bits  long. 

One  final  thing  to  note  is  that  we  can  easily  extend  the  above  to  give  (logc  n)-wise  independence 
for  any  c  by  using  the  same  techniques  to  obtain  labels  with  /  =  O(logc+1  n),  also  matching  the 
lower  bound. 

2.6  Zeroing  in  on  a  Good  Sample  Point 

Now  that  we  have  a  k- wise  independent  distribution,  we  will  explain  how  to  do  a  binary  search  on 
it  efficiently  in  parallel.  We  will  use  the  strategy  introduced  by  Luby  in  [L2]  for  zeroing  in  on  a 
“good”  sample  point;  i.e.,  an  u  such  that  F(X)  >  F[r(A)]. 

To  zero  in  on  a  good  w,  one  bit  of  u  is  determined  at  a  time,  thereby  performing  a  binary 
search  on  the  u/s.  This  is  done  as  follows.  At  the  beginning  of  iteration  t,  assume  we  have 
set  lji  =  si , . .  . ,  wt_i  =  .  Then  we  compute  E[F(X)  |  =  si , . . .  =  s(_i  ,w(  =  0]  and 

E[F( X)  |  u>i  =  sj. ...  ,ut_\  =  st~ i,u>t  =  1].  We  then  set  ut  to  the  st  €  {0, 1}  which  maximizes 
E[F(X )  |  u\  =  si, . . .  ,w(_i  =  S{_i,u;t  =  st].  We  will  show  how  to  compute  these  conditional 
expectations  in  Section  2.7. 

Lemma  2.10  A  fter  step  t  of  the  above  procedure,  E{  F(X)  |  uq  =  sj , . . .  ,u><  =  st  ]  >  £’[F(A')]. 

Proof  (by  induction  on  |s|) 

The  case  |s|  =  0  is  clearly  true.  Assume  this  lemma  is  true  for  t  —  1;  i.e.,  we  have 
E[  F(X)  |  wi  =si, . .  =  st_,  ]  >  F[F(A)].  Then 


E[  F(X) \u1=si,...,L>t  =  st] 

-  max(F[  F(X)  |  =s,,. .  =st-i,vt  =0],  E[  F(  X)  \  u>i  ~su  . . .  ,ut-i  =at_i,wt  =  1]) 

>  (E{  F(X)  I  uq  =«,,. .  .,u>t  _!=«<_,,  o>t  =  0],F[F(  A')  |  ui  =st, ...  ,^t-\  =«<-!,  w(  =  l])/2 
=  E[F(X)  |  uji  =su...,ut~i  =st-i ) 

>  F(F(A)]  (by  inductive  hypothesis).  □ 

Corollary  2.11  The  output  of  the  above  procedure  is  an  X  such  hat  l  ( X)  >  F[F(A)J. 
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2.7  Computing  Conditional  Expectations 

In  geiural.  computing  conditional  expectations  is  hard  to  do  and  separates  when  one  can  and 
cannot  use  Raghavan/Spenrer-liko  techniques  to  zero  in  on  a  a  good  sample  point.  Fortunately, 
in  the  case  of  discrepancy,  we  have  devised  a  simple  and  efficient  approach  for  computing  condi¬ 
tional  expert at j, ins.  Recall  that  to  solve  discrepancy,  we  need  to  compute  conditional  expectations 
F(  l'{  A  )  I  —  S| . -  .s,  ]  where  /•'(  A  )  is  of  tin*  special  form 


/■'(  .V  i 


£  L  •••  E(-dx,,+ "+A^- 


1  sing  linearity  of  expected  value,  we  can  brrak  tins  up  into  components 

=  .*■•'[(- 1  |  uq  =«i, - 

-  F[(-n^=.a,/w  1^  =  .! . «,,  =  *] 

=  h [(  —  1 )  !  ^‘i  =  -Si . . . . ,  =  .s(] 


where  a  =  nt).  Let  r  he  the  last  position  vhich  contains  a  1  in  a.  If  /  <  r,  then  a  ■  u,’  is 

unbiased,  and  therefore  ft,,  ..,k(.s)  =  0.  Otherwise,  a  -us  is  the  same  for  all  uj  which  extend  and 
hence  ft,,  ,k  =  /•.'[(- 1  j'7'-1'].  Assuming  we  have  precomputed  a  and  r,  we  can  compute  ft, fc(s)  in 
constant,  time  by  extending  a  partial  sum  nj  sj  at  each  iteration  and  outputting  h^...,k(s)  =  0 

if  /  <  r  and  ft, s)  =  (  - 1  'h  if  t  >  r. 

To  compute  F[F(. Y)  |  u-q  —  sj . -  .st],  we  need  one  processor  for  each  possible  < 

zl.ii . ifc  >.  that  is.  at  most  tiA^  total.  Therefore,  k  must  be  Letting  k  be  the 

minimum  possible,  2 [ 2 ‘toga  1  ’  ‘ i*1  pi ios  that  n3+Of  processors  is  sufificient. 

Then  we  can  compute  all  ft;,  (s)  terms  in  parallel  in  constant  time  and  sum  them  to  get 

E[F(X)  |  =.S) . ut  =  st }  in  O(logn)  time.  Thus  we  spend  0(1  logn)  time  in  the  /  iterations 

of  our  procedure.  In  addition,  we  can  perform  the  precomputation  required  above  in  0(1  logn) 
time  as  well.  Since  I  —  O(log2  n),  this  yields  an  O(log3  n)  algorithm  for  discrepancy. 

2.8  Application  to  Edge  Coloring 

An  edge  coloring  at  a  graph  (J  —  (V-,  F)  is  an  assignment  of  colors  to  all  edges  of  the  graph,  so  that 
any  two  edges  that  share  a  common  vertex  are  assigned  different  colors.  Let  A  be  the  maximum 
degree  in  (j.  Observe  that  any  coloring  requires  at,  least  A  colors.  In  fact,  Vizing  [V]  implicitly  gives 
a  polynomial  time  algorithm  to  A  +  1  color  any  graph.  KarlofT  and  Shmoys  fKS]  provide  a  parallel 
implementation  of  this  algorithm  to  get.  a  A  +  1  coloring  of  any  graph  in  time  0(  A0*1'  log0*1*  n) 
using  a  polynomial  number  of  processors.  Also  of  interest,  there  exist  NC  algorithms  for  optimally 
coloring  bipartite  graphs  with  A  colors  [GK.LPV,CTI,AIS].  Furthermore,  there  is  a  trivial  SC 
algorithm  to  2A  -  1  color  any  graph  by  A  +  1  vertex  coloring  [L2]  the  line  graph.  Berger  and  Shor 
[BS],  Karloff  [K ] ,  and  Naor  JN]  found  NX'  algorithms  to  A  +  A/log^^n  color  any  graph. 

Of  particular  interest  here,  there  is  an  RNC  algorithm  in  [KS]  which  A  +  A,,/2+'  colors  any 
graph.  We  will  remove  the  randomness  from  this  algorithm  by  using  the  techniques  discussed 
above.  I  fie  RNC  algorithm.  Algorithm  A,  is  as  follows: 

1.  If  A  <  flogjj)1^,  then  use  the  [KS]  A  F  I  deterministic  algorithm. 


2.  Run  an  RNC  algorithm  for  graph  discrepancy,  randomly  picking  x  until  disc{x )  <  A1/,2+f. 
Let  A  =  {e  |  \(iA  =  +1}  and  B  =  {v  |  x(v)  =  -!}•  This  partition  gives  us  two  graphs,  both 
with  vertex  set  V:  bipartite  graph  C\,  which  has  all  the  edges  of  G  between  A  and  B\  graph 
G2  which  has  all  the  other  edges  of  G. 

3.  Color  Gi  using  the  A  coloring  algorithm  for  bipartite  graphs. 

4.  Recurse  on  G2,  using  a  new  set  of  colors. 

This  algorithm  works  because  the  above  partition  implies  that  both  G\  and  G 2  have  maximum 
degree  at  most  A/2  +  A1/2*'. 

To  make  Algorithm  A  deterministic,  we  need  only  demonstrate  a  deterministic  method  for  graph 
discrepancy,  which  we  said  in  Section  2.1  was  a  special  case  of  the  set  discrepancy  problem.  Plugging 
in  the  set  discrepancy  results  with  e'  =  c/2,  we  get  a  x  such  that,  disc(x)  <  A!/2+t  v/log  n.  Note 
that  A  >  (logn)1/',  since  we  handled  the  other  case  in  Step  1  of  Algorithm  A.  Thus,  %/log  n  <  Af'2. 
So  we  have  disc(x)  <  A1/2*',  which  implies  x(N(v))  <  A1/,2+e  for  all  v  €  VC 


3  Setting  up  a  General  Framework 

For  discrepancy-based  problems,  we  considered  a  very  specific  class  of  functions,  namely  those  of 

the  form  A,  J,  and  showed  how  to  achieve  the  expected  value  for  these.  What  can 

we  do  in  general?  In  particular,  for  which  functions  can  we  compute  conditional  expectations  (the 
method  of  Section  2.6  will  then  apply  to  achieve  the  expected  value)?  In  order  to  give  ourselves  a 
fighting  chance,  we  will  restrict  our  attention  to  functions  of  the  form 

These  are  exactly  the  functions  for  which  we  can  apply  Lemma  2.4  to  show  that  fc-wise  independence 
gives  the  same  expected  value  as  full  independence.  Since  we  will  require  at  least  one  processor  for 
each  /,  term,  we  will  require  m  to  be  polynomial  in  n.  In  Section  3.1,  we  will  show  how  to  compute 
conditional  expectations  for  arbitrary1  /;  when  k  =  O(logn).  In  Section  3.2,  we  will  describe  the 
/,’ s  for  which  we  can  handle  the  case  k  =  O(logcn). 

3.1  Logarithmic  Number  of  0/1  Variables 

In  this  section,  we  will  present  two  different  methods  for  computing  conditional  expectations  for 
functions  of  the  form 

n° 

F(X)  -  y^J  fi(X  1.1)  •  •  •  1  V i,b  log  n  )  • 
t=l 

We  present  both  methods  because,  depending  on  what  problem  we  wish  to  solve,  either  of  the  two 
methods  may  be  more  efficient. 

The  first,  method  is  to  rewrite  F  to  be  of  the  form  solved  in  Section  2.7.  Let  g  :  2'1 . "R 

be  such  that 

g{A)  =  /.(A',,1, . . .,  A',,*.)  such  that  ,YtJ  = 

(i.e.,  g  of  a  set  is  /,  applied  to  its  characteristic  vector).  The  following  lemma  follows  from  the 
theory  of  harmonic  analysis  on  the  cube. 

lIn  fact,  wc  will  require  a  uniformity  condition,  but  will  ignore  this  issue  for  now. 


f  1  if  ;  G  A 
{  0  if;  <t.A 
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Lemma  3.1  (KKL)  g(A)  =  Zsc{i . *}  «s(-l)|5rM|,  whereas  =  2~kZBc{i . *}  g(B)(-1)\SnB\. 

Thus, 

fi(XiA,...,Xitk)  =  g{{]\  A',,j  =  1}) 

=  ^2  Qs(  —  (by  Lemma  3.1) 

s 

s 

Since  we  have  now  written  F  is  /»,■(  — 1  we  can  apply  ihe  technique  of  Section  2.7  to 

compute  conditional  ex>ectat  ons.  This  gives  us  the  following  theorem: 

Theorem  3.2  There  is  an  NC  algorithm  which  given  any  F  :  Z™  —*  R  of  the  form 

na 

F(X)  =  J2MXnU...,x,Mosn), 

1=1 


outputs  an  X  with  F( X)  >  £[F(A')]. 

An  alternative  method  for  computing  conditional  expectations  for  F  is  as  follows.  First  note 
that,  by  linearity  of  expectation,  it  suffices  to  compute  the  conditional  expectations  of  the  individual 
/,  and  sum.  Assume  we  wish  to  compute  E[  fi{Xi,\,  •  •  ■ ,  ^a.ftiogn)  |  u?i  =  Si, . . .  =  a*  ].  Let  x  be 

the  vector  <  A',i . xi,biogn> ,  and  let  A  be  the  matrix  whose  rows  are  the  corresponding  labels 

Mogn-  Then  x  =  Au.  So 

E[  /.( A',.i, . .  • ,  A'^Mogn)  I  U>1  =Si, . .  .,W(  =  5(  ]  =  ^2f,(x)Pr[  Au  =  x  \  u>i  =  Si , . . .  ,u>t  =  st  ]. 

X 

If  we  let  u/  =<  u\ , . . .  ,ujt  >,  u"  =<ui+1 , . . .  >,  A'  and  A"  be  the  first  t  and  last  l  -  t  columns 

of  A  respectively,  and  s  =<  sj, . . . ,  st  >,  then 


E[fi(x)  !  uq  =s1,...,W(  =  «<]  =  '}Tfi(x)Pr[A'u>'  +  A"u>"  =  x  \  J  =  s] 

X 

—  ^  fi(x)Pr[  A"u "  —  x  —  A's]. 

X 

For  each  i,  we  can  test  if  the  linear  system  A" u"  =  x  -  A's  is  solvable;  if  it  is  Pr[  A"u>"  =  x  —  A's  ]  = 
2rank(/t'  )+t-l,  otherwise  Pr{  A" u"  =  x  -  A's  ]  =  0.  Since  we  can  compute  the  contribution  of  each 
of  the  x’s  in  parallel,  we  can  compute  the  desired  conditional  expectation  in  NC,  thus  giving  an 
alternate  proof  for  Theorem  3.2. 

3.2  Polylogarithmic  Number  of  0/1  Variables 

Now  we  will  consider  the  case  of  functions  depending  on  a  polylogarithmic  number  of  variables.  A 
simple  counting  argument  shows  that  in  NC  we  cannot  compute  all  functions  of  logc  n  variables, 
when  c  >  1,  let  alone  compute  conditional  expectations  of  them.  In  fact,  both  techniques  of 
Section  3.1  require  evaluating  /,  at  every  point;  if  /,  depends  on  more  than  a  logarithmic  number 
of  variables,  there  will  be  a  superpolynomial  number  of  points  to  evaluate.  However,  there  arc  some 
special  cases  for  which  we  can  compute  conditional  expectations. 
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T  he  first  special  case  we  can  handle  is 


MX.J . = 

This  function  can  L>e  evaluated  using  the  techniques  of  Section  2.7,  even  if*  =  log^n.  In  Section  3.1, 
we  showed  how  to  transform  any  function  into  a  linear  combination  of  these;  if  this  transformation 
is  already  known  and  provides  only  a  polynomial  number  of  non-zero  ads,  we  can  use  this  technique. 
The  next  special  case  will  be  based  on  the  second  technique  of  Section  3.1.  Recall,  we  had 

£[/■(  -V..i . -V,.*)  |  iJr  =  «i,...,  u/,  =  st]  =  Y^Mx)Pr[A"u"  =  x-  A's  ]. 

X 

We  can  restrict  our  attention  to  those  x  for  which  f,(x)  ^  0.  If  there  are  a  polynomial  number 
of  these,  we  can  compute  conditional  expectations  of  /,  for  k  —  log0  n.  Some  examples  of  this  are 
logical  AND  and  NOR  of  a  polylogarithmic  number  of  variables  (each  has  one  non-zero  point). 

A  variant  of  the  above  is  the  case  of  monomials,  /,(Aqi, . . . ,  A',,*)  =  ATqiA',^  •  •  •  X,k.  This 
is  equivalent  to  logical  AND  described  above.  Note  that  handling  monomials  is  strictly  weaker 
since,  for  example,  it  is  impossible  to  write  a  poly-log  variable  NOR  as  a  linear  combination  of  a 
polynomial  number  of  monomials. 

Finally,  we  give  a  type  of  /,  which  can  simulate  all  the  above  and  more.  Consider  functions  of 
the  form 

f  1  \{  x  ~  y  +  T z  {or  some  2  6  Z| 

'  X  (  0  otherwise 

for  some  y  6  Z% ,  T  €  Z^*  (which  may  be  different  for  different  /, ). 

These  are  the  characteristic  functions  of  affine  subspaces.  Included  are  characteristic  functions 
of  all  single  points;  we  can  write  any  function  with  a  polynomial  number  of  non-zero  points  as 
a  linear  combination  of  these.  Functions  (-l)^-o  X,  J  can  also  be  put  in  this  form.  To  compute 
conditional  expectations,  we  use  a  variant  of  our  linear  algebra  method: 

E[fi(x)  |  ui=si . w,  =  s(]  =  'jT,fl(x)Pr[A*>  =  y  +  Tz\u'  =  s] 

r 

=  ^/,(i)Pr[/w"  +  'f2  =  j-/  I'a] 

T 

which  can  be  computed  by  performing  Gaussian  Elimination  to  determine  how  many  bits  of  u"  are 
free  to  vary.  This  gives  us  the  following  theorem: 

Theorem  3.3  There:  is  an  NC  algorithm  which  given  any  F  :  Z%  — *  R  of  the  form 

na 

f  (  A  )  =  ^  *  ft  ( A ,  i , . . . ,  A  ,■  (,  i0gc  n ) , 

1=1 

where  each  ft  is  the  rhararteristir  function  of  some  eiffine  subsjxice  of  Z^°s  n,  outputs  an  X  with 

F(X)  >  £[/•’( A')]. 


4  Handling  Multivalues  —  the  Hypergraph  Coloring  Problem 

In  the  previous  sections,  we  were  only  concerned  with  the  rase  where  the  random  variables  took 
on  values  0  and  1  each  with  probability  1/2.  Yet,  for  many  problems,  this  model  is  too  restrictive. 
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In  this  section,  we  expand  o  ir  framework  to  consider  random  variables  drawn  from  a  uniform 
distribui  ion  over  a  large  set  o'  values.  1'his  can  i  hen  be  used  to  simulate  non-uniform  distributions. 
We  will  demonstrate  our  techniques  for  handling  multivalued  random  variables  on  the  following 
problem: 


A  hypergraph  H  —  ( V ,  £ )  is  a  system  €  of  subsets  of  V  called  edges.  H  is  d-uniform  if  every  edge  has 
d  elements.  Kleitman  and  Alon,  Babai,  and  Itai  [Kl,  ABI]  define  the  large  d-partite  subhypergraph 
problem  as  follows.  Given  a  d-uniform  hypergraph  H  =  (V,  £),  find  a  d-coloring  of  V'  such  that  the 
number  of  edges  in  £  having  precisely  one  vertex  of  each  color  is  at  least  jj^dl/d^J .  [ABI]  showed 
this  problem  is  in  NC  for  constant  d.  We  will  show  in  this  section  that  this  problem  is  in  NC  for 
all  d.  Since  the  case  of  d  >  In  |£|  +  £7(lg lg  |£|)  is  trivially  satisfied  by  any  coloring  that  colors  one 
hvperedgc  correctly,  we  will  henceforth  restrict  our  attention  to  the  case  d  <  In  |£|  +  0(lglg  |£|). 


4.1  Randomize J  Algorithm 

In  this  section,  we  give  a  randomized  parallel  algorithm  and  prove  that  the  expected  number  of 
properly  colored  edges  is  as  desired.  The  randomized  algorithm  is  as  follows.  Randomly  assign  to 
each  vertex  an  /  =  31og|£|  bi 1  label.  Designate  these  as  random  variables  Y  —  Y\ , . . . ,  l|p|.  Let 

p  =  |^2,/d| .  A  vertex  is  mapped  to  color  i  if  its  label  is  in  C,  =  {(i  —  l)p, . . . ,  ip  —  1}.  Note  that  every 

color  has  p  values  associated  with  it.  Vertices  with  values  in  the  range  dp  to  2*  -  1  are  uncolored. 
Note  that  fewer  than  d  of  the  21  possible  values  yield  an  uncolored  node. 

Now  for  the  analysis.  We  define  a  benefit  function,  G(Y ),  which  will  be  the  sum  of  terms  ge(Y ), 
one  for  each  e  €  £.  Each  ge(Y)  will  be  1  if  the  vertices  of  edge  e  are  assigned  d  different  colors, 
and  0  otherwise.  In  calculating  the  expected  value  of  ge(Y),  we  get 


E[ge(Y)} 

>  Pr[ge(Y)  =  1  |  no  vertex  on  edge  e  gets  non-color]  -  Prfsome  vertex  on  edge  e  gets  non-color] 

d[  _ 

~  dd  2‘ 
d\  d2 

=  d*  ~  |£P  ^S'nCe  1  =  3  log  I™’ 


Then, 


E[G(Y)}  =  X>[<7e(K)J  >  > 


-  1. 


In  fact,  since  G(Y)  is  integral,  we  know  that  G(Y)  >  £[G(T)]  implies  that  G(Y)  >  ,  which 

is  exactly  what  is  desired. 


4.2  The  Basic  Approach 

In  this  section,  we  discuss  various  approaches  for  determinizing  algorithms  which  use  multivalued 
random  variables.  These  approaches  have  different  advantages  and  disadvantages  and  may  all  prove 
useful  in  applications. 

The  easiest  approach  to  handling  functions  of  multivalued  random  variables  is  to  represent 
each  variable  by  a  collection  of  boolean  random  variables.  In  particular,  for  the  large  d-partite 
subhypergraph  problem,  if  d  is  a  power  of  2,  d  =  0(log |£|/ loglog |£|),  we  can  represent  the 
color  of  each  vertex  by  Igd  boolean  random  variables.  Then  each  gt  would  become  a  function  of 
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d\gd  =  0( log  |£|)  boolean  variables,  allowing  us  to  apply  the  general  framework  of  Section  3  to 
find  a  good  coloring. 

A  second  approach  we  might  consider  would  be  to  replace  Zi  in  our  distrubution  with  some 
other  finite  field  GF(q).  For  the  large  d-partite  subhypergraph  problem,  if  d  is  any  prime  power, 
d  =  O(log \£ |/ loglog \£ |),  we  can  replace  Z^  with  GF(d).  Theorem  2.8  will  still  hold  and  all  of  the 
approaches  to  get  labels  can  be  easily  modified  to  work,  giving  us  a  distribution  with  (logn)-wise 
independent  random  variables  uniformily  distributed  over  GF(d).  Since  d  is  small  (we  only  require 
polynomial  in  n),  we  can  try  each  possible  value  for  the  next  element  of  u  in  parallel  and  pick  the 
one  with  the  best  conditional  expected  benefit  G.  To  evaluate  the  conditional  expectations,  we  can 
still  use  the  linear  algebra  method  of  Section  3.1  to  find  the  probability  a  collection  of  d  random 
variables  take  on  some  particular  value.  And  we  can  do  this  for  each  possible  value,  since  dd  is 
polynomial  in  n.  Thus  we  can  still  efficiently  zero  in  on  a  good  sample  point. 

To  find  a  good  coloring  for  any  d  up  to  log|£|,  we  must  use  a  more  complicated  approach,  one 
which  is  similar  to  the  one  used  by  Luby  for  A  +  1  vertex  coloring  [L2].  In  essense,  we  repeatedly 
use  the  0/1  problem  as  a  subroutine  to  set  one  bit  of  the  random  variables  at  a  time.  We  have 
multi  valued  random  variables  Y  —  Vj,...,  *V\  where  Y)  =  Y)2  -  Yu.  We  will  compute  the  Vi’s 

bit  by  bit.  At  step  t,  we  will  compute  X^  such  that 

E[G{Y)\Yij  =  Xjj)  for  1  <j<t]  >  E{G(Y)\Ytj  =  x\l}  for  1  <j<t-  1], 

If  we  let 

F(t>(X(t))  =  E[G(Y)\YtJ  =  Xjj)  for  1  <j<t], 
then  the  above  is  equivalent  to  finding  an  X W  with  F^(X^)  >  f^F^X^)].  Letting 

/i°(*(,))  =  E[gc{Y)\YtJ  =  X,0)  for  1  <j<t] 

allows  us  to  write  F(t>( X^)  as  a  sum  of  \£\  functions,  each  depending  on  at  most  d  <  lg  \£\  random 
variables  xj‘K  Assuming  that,  given  A'*1), . . . ,  X^~l\  we  can  construct  functions  /e(t)  (we  will  show 
how  to  do  this  in  the  next  section)  we  can  find  a  good  A"*!)  using  the  general  framework  of  Section 
3.1. 

A  simple  inductive  argument  shows  that  for  all  t, 

E[G(Y)\Yij  =  X}J)  for  1<j<<]  >  £[G(K)]. 

It  follows  that  letting  Y  be  such  that  YtJ  =  xj^  for  all  i  and  j  implies  that  G(Y)  >  E[G(Y)}. 


4.3  The  Deterministic  Algorithm 

To  apply  the  last  multivalued  approach  described  in  the  previous  section,  we  must  show  how  to 
construct,  for  any  t  and  for  any  settings  of  the  first  t  -  1  bits  X^1), . . . ,  A'^-1),  functions 

Y">)  =  t-l'MY ,\YtJ  =  X,fj)  for  1  <  /  <<]. 

To  do  this,  we  will  show  hex  to  compute 

F[f)e(Y  YXj  =  \\<J)  for  I  <j<t}- 

it  then  suffices  to  plug  in  the  given  X(l\ . . . ,  X<,-1>  and  every  possible  setting  of  the  variables 
{A'/^Ji  €  (}  to  construct 
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Given  edge  e  and  the  first  t  bit?  of  each  label,  /j^  will  be  the  probability  that  the  edge  is 
properly  colored.  To  calculate  fi‘\  we  sort  the  vertices  of  edge  e  into  groups  having  the  same  /-bit 
prefix.  For  each  t  bit  string  a,  we  let  Sa  be  the  set  of  vertices  which  have  prefix  a  and  let  la  be  the 
set  of  2l~{  values  which  have  prefix  a.  We  let  Ta  be  {1  +  X^0<a  l^li  •  •  • »  52f3<c,  |5/j|}.  Observe  that 
edge  e  will  be  properly  colored  if  and  only  if  for  each  a  the  vertices  in  Sa  are  assigned  the  colors 
in  Ta. 

Now  we  can  calculate  f\^  as  follows: 

/<f)i  -v(,))  =  n  Pr[  vertices  in  Sa  are  assigned  colors  in  Ta } 

=  n  i^i!  n  Pr[  vertex  in  Sa  gets  color  i  ] 

|or|=j+l 

=  n  mit  L|3#1- 

M=j+1  i€  Ta 

Theorem  4.1  The  large  d-partite  subhypergraph  problem,  finding  a  coloring  of  V  which  properly 
colors  at  least  |j£|d!/ddJ  edges,  is  in  NC.  In  fact,  a  more  careful  analysis  shows  that  finding  a 
coloring  of  V  which  properly  colors  at  least  \£\d\/dd  edges  is  possible  in  NC. 

5  Remarks 

There  are  many  other  problems  for  which  discrepancy  can  be  used  to  obtain  a  solution.  It  is 
interesting  to  observe  that,  although  high  independence  is  required  for  discrepancy  (Remark  2.7),  for 
other  problems  (e.g.  set  cover  type  problems)  where  it  might  appear  one  needs  it  [MNN2],  pairwise 
independence  suffices  [BRS].  This  makes  a  real  difference,  because  with  lower  independence,  it  is 
possible  to  use  far  fewer  processors,  and  even  approach  an  optimal  processor-time  product. 

As  a  final  note,  the  discrepancy  algorithm  of  Section  2  can  be  improved  to  yield  a  2\/Alog n 
bound  in  the  special  case  A  =  logcn.  The  improved  algorithm,  besides  achieving  a  better  discrep¬ 
ancy  bound,  is  a  nice  example  of  how  to  apply  the  techniques  of  this  paper.  The  basic  approach 
is  as  follows.  For  each  A  €  A,  we  let  ^(A”)  be  0  if  |x(A)[  <  2\/A logn,  and  1  otherwise.  Let 
G(X )  =  Y,AeA  3a(X),  i.e.  G(X)  is  the  number  of  unbalanced  edges.  We  want  to  find  an  X  such 
that  G(X)  <  £[G(X)]  <  1.  To  do  this  we  first  partition  T  into  a  poly-log  number  of  subsets 
rj,...,rr,  such  that  the  intersection  of  any  with  any  A  G  A  is  O(loglogn)  (this  can  be  done 
using  0(log log  n)- wise  independence  combined  with  some  techniques  from  [BRS]).  Then  for  each 
j  in  sequence,  we  can  construct  a  function  F^A(X^^),  where  are  the  random  variables  corre¬ 
sponding  to  Tj,  which  is  the  expected  value  of  G(X)  conditioned  upon  the  values  of  X^\  .... 
set  already  and  the  given  X^K  Each  F(A  will  be  a  sum  of  functions  depending  on  O(loglogn) 
variables  each,  so  we  can  apply  our  general  framework  to  find  a  good  X^K  A  simple  inductive 
argument  shows  that  when  we  are  done,  we  have  a  good  X .  The  number  of  processors  required  for 
this  algorithm  is  nlog^’^n. 
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